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Information encoding bv shortened Reed-Solomon codes 

The present invention concerns communication systems in which the 
data to be transmitted are subjected to a channel encoding in order to improve 
5 the fidelity of the transmission. It concerns more particularly decoding methods 
as well as the devices and apparatuses adapted to implement those methods. 

It will be recalled that so-called "channel" encoding consists, when 
the "codewords" sent to the receiver are formed, of introducing a certain amount 
of redundancy in the data to be transmitted. More particularly, by means of each 
10 codeword, the information is transmitted that is initially contained in a 
predetermined number k of symbols taken from an "alphabet" of finite size q ; 
on the basis of these k information symbols, calculation is made of a number 
n > k of symbols belonging to that alphabet, which constitute the components 
of the codewords: v«(v 0 ,v,, ...,v„_|) (the symbol means "by definition"). 
15 The set of codewords obtained when each information symbol takes some 
value in the alphabet constitutes a sort of dictionary referred to as a "code" of 
"dimension" k and "length" n . 

In particular, certain codes, termed "linear codes" are such that any 
linear combination of codewords (with the coefficients taken from the alphabet) 
20 is still a codeword. These codes may conveniently be associated with a matrix 
H of dimension (n-k)*n, termed "parity matrix": a word v of given length n is 

a codeword if, and only if, it satisfies the relationship: H v T = 0 (where the 

exponent T indicates the transposition); the code is then said to be "orthogonal" 
to the matrix H . 

25 Among these linear codes, certain have the property of being 

"cyclic": for any given codeword (v 0l v,, ...,v w _,), the word (v„_,,v 0 ,v, v„_ 2 ) 

obtained by cyclic permutation is a word of the same code. The words 
belonging to a cyclic linear code may conveniently be represented by means of 

n-l 

polynomials: if the polynomial v(x) = £ V/J r'' is made to correspond to some 

/'=0 

30 word v = (v 0 ,v, v„_,), a given word v belongs to a given cyclic linear code if, 
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and only if, the corresponding polynomial v(x) is a multiple of a certain 
polynomial g(x), of degree (n-k) and divisor of termed "generator 

polynomial" of the cyclic linear code which characterizes it. Moreover a parity 
matrix H can be associated with any generator polynomial g(x) . 
5 At the receiver, the decoding method correlative to a given encoding 

method judiciously exploits the redundancy incorporated in the codewords to 
detect any transmission errors and if possible to correct them. There is a 
transmission error if the difference e between a received word r and the 
corresponding codeword v sent by the transmitter is non-zero. 
1 0 More particularly, the decoding is carried out in two main steps. 

The first step consists of associating an "associated codeword" with 
the received word. Conventionally, the decoder first of all calculates the vector 

of "error syndromes" H ■ r = H ■ e . If the syndromes are all zero, it is assumed 
that no transmission error has occurred, and the "associated code word" will 

15 then simply be taken to be equal to the received word. If that is not the case, it 
is thereby deduced that certain symbols in the received word are erroneous, 
and a correction algorithm is then implemented which is adapted to estimate the 
value of the error e ; the algorithm will thus provide an estimated value e such 
that {r -i ) is a codeword, which will then constitute the "associated codeword". 

20 The second step simply consists in reversing the encoding method. 

In the ideal situation in which all the transmission errors have been corrected, 
the initial information symbols are thereby recovered. 

The purpose of an error correction algorithm is to associate with the 
received word the codeword situated at the shortest Hamming distance from 

25 that received word, the "Hamming distance" being, by definition, the number of 
places where two words of the same length have a different symbol. The 
shortest Hamming distance between two different codewords of a code is 
termed the "minimum distance" d of that code. This is an important parameter of 
the code. More particularly, it is in principle possible to find the position of the 

30 possible errors in a received word, and to provide the correct replacement 
symbol (i.e. that is identical to that sent by the transmitter) for each of those 
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positions, each time the number of erroneous positions is at most equal to 
INT[(oM)/2] (where "INT" designates the integer part) for a code of minimum 
distance d (for certain error configurations, it is sometimes even possible to 
achieve better). However, in all cases, the concern is not with a possibility in 
5 principle, since it is often difficult to develop a decoding algorithm achieving 
such performance. 

The present invention concerns in particular "Reed-Solomon" codes, 
which are reputed for their efficiency. These are cyclic linear codes, of which the 
minimum distance d is equal to ( « - k + 1 ). In general terms, codes of dimension 

10 k and of length n having a minimum distance d = n-k + ] are termed "MDS" 
(for "Maximum Distance Separable") since they are codes of which the 
minimum Distance allows a Maximum Separation between the codewords, 
account taken of the parameters k and n). 

When the size q of the "alphabet" is a power of a prime number, the 

15 alphabet can be given the structure of what is known as a "Galois field" denoted 
F q , of which the non-zero elements may conveniently be identified as each 

being equal to y' -1 for a corresponding value of /', where /' = 1,...,q-l, and 
where y is a primitive (g - l) th root of unity in F q . 

The parity matrix H of the Reed-Solomon code of dimension k and 
20 of length n (where n is necessarily equal to (q-\) or a divisor of (q-l)) is a 
matrix with m = n-k lines and n columns, which may be defined for example 
by taking H & - a (i+l)J (0 < / < m-\ , 0 <j < where a is an n ,h root of 

m 

unity in F Q . The generator polynomial of this code is g(*) = J"](x-a'). 

/=l 

Among the algorithms known for encoding a sequence 
25 q = (a Qi a h of information symbols belonging to F q by means of a Reed- 

Solomon code, certain use that parity matrix H. In these algorithms, a certain 
relationship is chosen between the information symbols and those of the 
corresponding codeword v (for example v^ag for 0</<£-l ; in this case, 
the encoding is said to be M systematic ,f ). Next the components of v remaining to 



WO 2005/(10890(1 



PCT/IB2004/002623 



14 



v*(x) = a(x)-r(x)^ £v*,jc' , 



corresponding to the word v* = (v* 0 ,v* l> ...,v*„._i) belonging to C. It should be 
noted that, by construction, the contribution of a(x) to v*(x) only contains 
powers of x belonging to s (the smallest being greater than or equal to s m ). 

5 The two main cases are then to be envisaged for the fourth step of 

the method according to the invention. 

If the m first integers of s comprise all the successive integers from 
0 to (m-\), then the contribution of r(x) to v*(x) only contains powers of x 
belonging to s (the largest being less than or equal to = m - 1 ). In this 
1 0 case, it suffices to take v' = v * . 

On the other hand, if there is at least one "hole" in the succession of 
the m first integers of s (and thus s m _ x >m-\, s m >m, and so forth), 
calculation will now be made of a word v' satisfying: 
• v '/ = v */ f° r s m < i '< m'-I (contribution of a{x) ), and 
15 • v'i = c h in which the c, are predetermined constants, for all i<s m not 
belonging to s , that is to say different from s 0 , s,, .... s m _ ] ; these values of 
/ are thus (s m -m) in number. 

It will be noted that v*,=0 for m<i <(s m -1); but, in the case 
considered in which s m ^ >m-\, v' ; =0 will not necessarily be satisfied for 
20 those indices, even when c, =0. This is because, according to the invention, a 
word v* belonging to C is obtained by linear combination of v * with e , where 
e = s m -m, judiciously chosen words r J belonging to C: 



d) 



in which the coefficients fj belong to F Q . Explicitly, this means: 



25 • for s m <i<n'-\ : v', = v*,- , 



(1a) 
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s m -m-\ 



• for m<i<s m : £ / y r>, , and 

y=0 

5,„-m-I 

• for 0< /</«: v' / = v* / + £ /W/ . 

y-o 



db) 



(1c) 

The words r J are constructed from the coefficients of 

ro-1 

*W s Z • Mor e Particularly, the words V J are defined as follows" 

n = 0 



5 for j<i<j + m : r>, = g,_ y , and (3a) 

- r y , = 0 otherwise. /ou . 

(3b) 

Explicitly: 

C° = (#o.£i £m.0,...,0) , 

C* =(0.go.^i.-.g OT ,0 0) , 

10 

T e 1 =(0,..., 0,g 0l g, g w ,0,...,0) , in which the index of the last non-zero 

component is / = .y m - 1 . 

The polynomial corresponding to the word r° being identical to 
g(x), it is of course divisible by g( x ), and r° thus belongs to C\ The other 
1 5 words r> just being circular permutations of r° , they also belong to C\ 

It then only remains to find the e coefficients of fj which satisfy the 
system of e equations concerned (in which it is stipulated that v\ = c h as 
mentioned above) among the n' linear equations represented by equation (1). It 
can be shown that, with the definitions (3a-3b), the determinant of that system is 
20 non-zero whatever said equations concerned are; this property is linked to the 
"MDS" character of the code C. 

The method described above will now be illustrated by means of a 
numerical example. 
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Take, for example, for code C, the one which served above by way 
of example, to illustrate the Euclidean algorithm (? = 2 4 , k'=\0, n , = q-\ = \s i 
m = 5, and g(x) given by equation (2)). It is desired to shorten it to construct a 
code C of dimension k = 5 and length n = 10; more specifically, suppose that it 
5 is wished to obtain v',- = 0 for / not belonging to 
£ = (1,2,4,5,7,8,10,1 1,13,14) . 
Thus, s m = 8, and e = 8-5 = 3 , and the three indices in question (that is to say 
not belonging to j, and less than s m =8) are: 0, 3 and 6. This then gives: 

r°=(l,Y,y 5 ,Y 2 ,y 7 ,l,0 0) , 

10 r l =(0,l,y,y 5 ,y 2 ,y 7 ,l 1 0 0) , 

r 2 =(0,0,l,y,y 5 ,y 2 , y 7 ,l,0 0) , 

and thus, in particular: 

r°o=o,r 0 3=y 2 ,r 0 6=o, 
r'o=o, r' 3 =y 5 , r' 6 =l,and 
15 r 2 0 =o, r 2 3 =y, r 2 6 =y 7 . 

Let for example the following information sequence be encoded 

2 = (y 9 .o, y ",o,o). 

The following polynomial is then made to correspond to it 
a(x) = Y 11 x" +Y V, 
20 and Euclidean division by g(x) gives: 

q{x) = y "x 6 + y 3 x 5 + y 9 x A + y 6 x 3 +y ]4 x 2 + y 13 + y 8 , and 

r(x) = yx 4 + yx 3 + y ]3 x 2 + y l0 x + y 8 . 
Consequently, 

v*(x)*r ,, x l, +yV+ T r 4 +Tx 3 + y ,3 x 2 +y ,0 x + T « . 
25 and thus: v * 0 = y 8 , v * 3 = y (naturally v * 6 = 0 ). 

Solving the 3 equations (1) then gives: 
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/o=y 8 ./«=y 2 ./2=Y , °. 

Finally, the following is obtained: 

v' = v^ y 8^^y 2 r^y l0 ^ 2 =(o J ^ l2 ,o,l,o 1 o,y IO ,y 9 ,o,o,y ll ,o,o 1 o). 

In this embodiment, once the encoding is terminated, the encoding 
5 unit 102 transmits the "pre-encoded" words v' to a shortening unit 20, which 
deletes the components of v' of which the index does not belong to the set s . 
Thus the words v belonging to the shortened code C are obtained. 

Thus, in the numerical example which has just been considered, the 
following is obtained: 

10 v = (Y 14 .y 12 ,l,0,Y ,0 ,y 9 ,0,y ,, ,0,o). 

It is clear, in view of the above account, that the lower the value of e , 
the lesser will be the calculations implied by the method according to the 
invention. It will now be shown how it is possible to minimize the value of e 
according to a refinement to the invention, for given s. 

15 As was shown above, the value of e is linked to the existence of 

"holes" in a succession of m consecutive elements of s. The refinement in 
question thus consists of searching in s for the sequence of m consecutive 
elements having the least "holes", and of bringing that sequence to the beginning 
of the words of the code C, taking advantage of the cyclic character of that code. 

-° This strategy will be better understood with the aid of a numerical 

example. Let us resume the example in which the polynomial generator is given by 
equation (2), but this time a set of predetermined positions will be considered 
which is given by: 

s = (1,4,6,7,8,9,1 1,1 2,1 3,1 4) . 
!5 Her e s m =s 5 =9, and thus the strict application of the method 

described above leads to forming the following polynomial from an information 
sequence a of length 5: 

a(x) = a 0 x 9 +a 2 x n +a 3 x 13 +a 4 x ]4 . 

Euclidean division makes it possible to construct the word 
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= (~ r 0 • " r \ ~ r 2 - r 3 ~ r 4 . 0,0,0,0, a Q ,0, a, , a 2 , a 3 , a 4 ) , 
in which it is proposed, for example, to transform the e = 4 components v* 0 , 
v* 2 , v* 3 , and v* 5 into constants which are all zero in the manner taught by 
the invention. 

5 It will now be shown that there is another manner of performing the 

calculations, which is appreciably less complex since it leads to a lower value of 

e, i.e. e* = 1. 

Indeed, it is noted that, in s , the sequence of 5 consecutive positions 
8,9,11,12,13 

10 has only a single "hole", i.e. position No. 10. Thus, if on all the words of code C, 
a circular permutation of 8 positions to the left is performed, the initial position of 
that sequence, i.e. position No. 8, is brought to initial position No. 0. Overall, s 
is thus transformed into 

^ = (0,1,3,4,5,6,8,11,13,14) , 

15 recalling that the positions of components are defined modulo «'= 15. Thus, in 
s * , the sole "missing" position below s * m = s * 5 = 6 is position No. 2, which 
corresponds indeed to e* = 1 . 

In polynomial "language", this circular permutation of 8 positions to 
the left corresponds to a multiplication by x~ s modulo (x ]5 -1). Now make the 

20 following polynomial correspond to the information sequence a : 

a(x) = a Q x + a x x A +a 2 x 6 +a 3 x 7 +a 4 x 14 , 
and define 

a * (x) = X - B a(x) = a 4 x 6 + a 0 x 8 + a ]X l 1 + a 2 x 13 + a 3 x ]A . 
The Euclidean division of a*(x) by g(x) then gives a remainder 

25 r*(x) = r* 4 x 4 +r*ix 2 +r* 2 x 2 +r* { x + r* 0 . 

It is thereby deduced that v * (x) = a * {x) - r * (x) , hence 

= (" r *0 - r * i -r * 2 -r * 3 -r % , 0, a 4 ,0, a 0 ,0,0, a x ,0, a 2 , a 2 ) . 
In this word v * , solely the component v * 2 = -r * 2 is both non-zero 
(in general) and absent from s* . The technique taught according to the 
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invention is next implemented, now with e* = 1, to deduce from v* a codeword 
v' having v' 2 = 0 . 

It remains nevertheless to apply a circular permutation of 8 positions 
to the components of this word v\ this time to the right, to get back to the 
5 positions prescribed by s . In this way a word of C is obtained which is the final 
result of the encoding according to the invention of the information sequence a . 

The words v issuing from the shortening unit 20 are finally 
transmitted by the transmission unit 103 to a predetermined recipient. This 
recipient may for example form part of a complex encoding system (for example 
10 relying on a multiplicity of shortened Reed-Solomon codes). This recipient may, 
according to another example, be a transmission chain comprising a modulator, 
which associates a modulation symbol with each predetermined number of 
binary symbols (bits), followed by a recorder or else (respectively) by a 
transmitter inserting the symbols in a transmission channel, that channel for 
15 example able to be a storage on a suitable carrier (such as a DVD, or a 
magnetic or magnetico-optical disc, or else magnetic tape), or (respectively) 
transmission by wire or wireless transmission (such as a radio link). 

The block diagram of Figure 2 represents, very schematically, a data 
processing apparatus 48 incorporating the encoder 102. 
20 Tnis apparatus 48 comprises a keyboard 911, a screen 909, a 

source of external information 100, a transmitter 103, conjointly connected to 
input/output ports 903 of an encoding device 102 which is implemented here in 
the form of a logic unit. 

The encoding device 102 comprises, connected together by an 
25 address and data bus 902: 

- a central processing unit 900, 

- a random access memory RAM 904, 

- a read only memory 905, and 

- said input/output ports 903. 

30 Each of the elements illustrated in Figure 2 is well known to a person 

skilled in the art of microcomputers and transmission systems and, more 
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generally, of information processing systems. These known elements are 
therefore not described here. It should be noted, however, that: 

- the information source 100 could, for example, be an interface 
peripheral, a sensor, a demodulator, an external memory or other information 

5 processing system (not shown), and could for example supply sequences of 
signals representing speech, service messages or multimedia data in particular 
of IP or ATM type, in the form of sequences of binary data, and 

- the transmitter 103 is adapted to transmit the words belonging to code 
C, for example to a unit belonging to a complex encoding system, or to a device 

1 0 for sending on a radio channel or for recording on a carrier for mass storage. 

The random access memory 904 stores data, variables and 
intermediate processing results, in memory registers bearing, in the description, 
the same names as the data whose values they store. It should be noted, in 
passing, that the word "register" designates, throughout the present description, 
15 a memory area of low capacity (a few items of binary data) and equally well a 
memory area of high capacity (for storing a complete program) within a random 
access memory or read only memory. 

The random access memory 904 contains in particular the following 

registers: 

20 - a register "information_symbols" in which the information symbols 

belonging to F q are stored, 

- a register "pre-encoded_words", in which the words v' belonging to 
the non-shortened code are stored, and 

- a register "code_words", in which the words v belonging to the 

25 shortened code are stored, before being submitted to the transmitter 103. 

The read only memory 905 is adapted to store, in registers which, for 
convenience, have the same names as the data which they store: 

- the operating program of the central processing unit 900, in a register 
"program", 

30 - the length «' of the words belonging to the non-shortened code, in a 

register V ", 
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- the length n of the words belonging to the shortened code, in a 
register 'V\ 

- the set s of the positions of the components to keep after shortening, 
in a register V, 

5 - the cardinal of the Galois field serving as alphabet for the code 

used, in a register "cf\ 

- the number k = n-m of information symbols serving to construct a 
codeword, in a register "K\ and 

- the coefficients of the polynomial generator g(x) of the non-shortened 
10 code, in a register "gr"- 



